DNA cloning vector plasmids and methods for their use

ABSTRACT

The present invention is a group of cloning vector plasmids for use in constructing DNA molecules, such as transgenes, for the purpose of gene expression or analysis of gene expression. The present invention is also a method for using the cloning vector plasmids in a variable series of cloning steps to produce a final transgene product. The plasmid cloning vectors are engineered to minimize the amount of manipulation of DNA fragment components by the end user of the vectors and the methods for their use. Transgenes produced using the invention may be used in a single organism, or in a variety of organisms including bacteria, yeast, mice, and other eukaryotes with little or no further modification.

[0001] This application claims the benefit under 35 U.S.C. 119(e) of provisional application 60/417,282, filed Oct. 9, 2002.

FIELD OF INVENTION

[0002] The present invention relates to the field of cloning vector plasmids, and to the use of cloning vector plasmids to build DNA constructs or transgenes.

DESCRIPTION OF THE PRIOR ART

[0003] The foundation of molecular biology is recombinant DNA technology, which can here be summarized as the modification and propagation of nucleic acids for the purpose of studying the structure and function of the nucleic acids and their protein products.

[0004] Individual genes, gene regulatory regions, subsets of genes, and indeed entire chromosomes in which they are contained, are all comprised of double-stranded anti-parallel sequences of nucleotides identified conventionally by the initials A, T, G, and C. These DNA sequences, as well as cDNA sequences derived from mRNA molecules, can be cleaved into distinct fragments, isolated, and inserted into a vector such as a bacterial plasmid to study the gene products. A plasmid is an extra-chromosomal piece of DNA that was originally derived from bacteria, and can be manipulated and reintroduced into a host bacterium for the purpose of study or production of a gene product. The DNA of a plasmid is similar to all chromosomal DNA, in that it is composed of the same A, T, G, and C nucleotides encoding genes and gene regulatory regions, however, it is a relatively small molecule comprised of less than approximately 30,000 base-pairs, or 30 kilobases (kb). In addition, the nucleotide base pairs of a double-stranded plasmid form a continuous circular molecule, also distinguishing plasmid DNA from that of chromosomal DNA.

[0005] Plasmids enhance the rapid exchange of genetic material between bacterial organisms and allow rapid adaptation to changes in environment, such as temperature, food supply, or other challenges. Any plasmid acquired must express a gene or genes that contribute to the survival of the host, or it will be destroyed or discarded by the organism since the maintenance of unnecessary plasmids would be a wasteful use of resources. A clonal population of cells contains identical genetic material, including any plasmids it might harbor. Use of a cloning vector plasmid with a DNA insert in such a clonal population of host cells will amplify the amount of the DNA of interest available. The DNA so cloned may then be isolated and recovered for subsequent manipulation in the steps required for building a DNA construct. Thus, it can be appreciated that cloning vector plasmids are useful tools in the study of gene fuction.

[0006] While some elements found in plasmids are naturally occurring, others have been engineered to enhance the usefulness of plasmids as DNA vectors. These include antibiotic- or chemical-resistance genes and a multiple cloning site (MCS), among others. Each of these elements has a role in the present invention, as well as in the prior art. Description of the role each element plays will highlight the limitations of the prior art and demonstrate the utility of the present invention.

[0007] A particularly useful plasmid-born gene that can be acquired by a host is one that would confer antibiotic resistance. In the daily practice of recombinant DNA technology, antibiotic resistance genes are exploited as positive or negative selection elements to preferentially enhance the culture and amplification of the desired plasmid over that of other plasmids.

[0008] In order to be maintained by a host bacterium a plasmid must also contain a segment of sequences that direct the host to duplicate the plasmid. Sequences known as the origin of replication (ORI) element direct the host to use its cellular enzymes to make copies of the plasmid. When such a bacterium divides, the daughter cells will each retain a copy or copies of any such plasmid. Certain strains of E. coli bacteria have been derived to maximize this duplication, producing upwards of 300 copies per bacterium. In this manner, the cultivation of a desired plasmid can be enhanced.

[0009] Another essential element in any cloning vector is a location for insertion of the genetic materials of interest. This is a synthetic element that has been engineered into “wild type” plasmids, thus conferring utility as a cloning vector. Any typical commercially-available cloning vector plasmid contains at least one such region, known as a multiple cloning site (MCS). A MCS typically comprises nucleotide sequences that may be cleaved by a single or a series of restriction endonuclease enzymes (hereafter referred to as “restriction enzymes”), each of which has a distinct recognition sequence and cleavage pattern. The so-called recognition sequences (which are referred to as restriction enzyme “sites”) encoded in the DNA molecule comprise a double-stranded palindromic sequence. For some restriction enzymes, as few as 4-6 nucleotides are sufficient to provide a recognition site, while some restriction enzymes require a sequence of 8 or more nucleotides. The enzyme EcoR1, for example, recognizes the hexanucleotide sequence:^(5′)G-A-A-T-T-C^(3′), wherein 5′ indicates the end of the molecule known by convention as the “upstream” end, and 3′ likewise indicates the “downstream” end. The complementary strand of the recognition sequence would be its anti-parallel strand, ^(3′)G-A-A-T-T-C-^(5′). Thus the double stranded recognition site can be represented within the larger double-stranded molecule in which it occurs as: ^(5′)......G-A-A-T-T-C......^(3′) ^(3′)......C-T-T-A-A-G......^(5′)

[0010] Like many other restriction enzymes, EcoR1 does not cleave exactly at the axis of dyad symmetry, but at positions four nucleotides apart in the two DNA strands between the nucleotides indicated by a “/”: ^(5′)......G/A-A-T-T-C......^(3′) ^(3′)......C-T-T-A-A/G......^(5′)

[0011] such that double-stranded DNA molecule is cleaved and has the resultant configuration of nucleotides at the newly formed “ends”: ^(5′)......G^(3′) ^(5′)A-A-T-T-C.......^(3′) ^(3′)......C-T-T-A-A^(5′)              ^(3′)G.......^(5′)

[0012] This staggered cleavage yields fragments of DNA with protruding 5′ termini. Because A-T and G-C pairs are spontaneously formed when in proximity with each other, protruding ends such as these are called cohesive or sticky ends. Any one of these termini can form hydrogen bonds with any other complementary termini cleaved with the same restriction enzyme. Since any DNA that contains a specific recognition sequence will be cut in the same manner as any other DNA containing the same sequence, those cleaved ends will be complementary. Therefore, the ends of any DNA molecules cut with the same restriction enzyme “match” each other in the way adjacent pieces of a jigsaw puzzle “match”, and can be enzymatically linked together. It is this property that permits the formation of recombinant DNA molecules, and allows the introduction of foreign DNA fragments into bacterial plasmids, or into any other DNA molecule.

[0013] A further general principle to consider when building recombinant DNA molecules is that all restriction sites occurring within a molecule will be cut with a particular restriction enzyme, not just the site of interest. The larger a DNA molecule, the more likely it is that any restriction site will reoccur. Assuming that any restriction sites are distributed randomly along a DNA molecule, a tetranucleotide site will occur, on the average, once every 4⁴ (i.e., 256) nucleotides, whereas a hexanucleotide site will occur once every 4⁶ (i.e., 4096) nucleotides, and octanucleotide sites will occur once every 4⁸ (i.e., 114,688) nucleotides. Thus, it can be readily appreciated that shorter recognition sequences will occur frequently, while longer ones will occur rarely. When planning the construction of a transgene or other recombinant DNA molecule, this is a vital issue, since such a project frequently requires the assembly of several pieces of DNA of varying sizes. The larger these pieces are, the more likely that the sites one wishes to use occur in several pieces of the DNA components, making manipulation difficult, at best.

[0014] Frequently occurring restriction enzymes are herein referred to as common restriction enzymes, and their cognate sequences are referred to as common restriction sites. Restriction enzymes with cognate sequences greater than 6 nucleotides are referred to as rare restriction enzymes, and their cognate sites as rare restriction sites. Thus, the designations “rare” and common” do not refer to the relative abundance or availability of any particular restriction enzyme, but rather to the frequency of occurrence of the sequence of nucleotides that make up its cognate recognition site within any DNA molecule or isolated fragment of a DNA molecule, or any gene or its DNA sequence.

[0015] A second class of restriction endonuclease enzymes has recently been isolated, called homing endonuclease (HE) enzymes. HE enzymes have large, asymmetric recognition sites (12-40 base pairs). HE recognition sites are extremely rare. For example, the HE known as I-SceI has an 18 bp recognition site (5′ . . . TAGGGATAACAGGGTAAT . . . 3′), predicted to occur only once in every 7×10¹⁰ base pairs of random sequence. This rate of occurrence is equivalent to only one site in 20 mammalian-sized genomes. The rare nature of HE sites greatly increases the likelihood that a genetic engineer can cut a final transgene product without disrupting the integrity of the transgene if HE sites were included in appropriate locations in a cloning vector plasmid.

[0016] Since a DNA molecule from any source organism will be cut in identical fashion by its cognate restriction enzyme, foreign pieces of DNA from any species can be cut with a restriction enzyme, inserted into a bacterial plasmid vector that was cleaved with the same restriction enzyme, and amplified in a suitable host cell. For example, a human gene may be cut in 2 places with EcoR1, the fragment with EcoR1 ends isolated and mixed with a plasmid that was also cut with EcoR1 in what is commonly known as a ligation reaction or ligation mixture. Under the appropriate conditions in the ligation mixture, some of the isolated human gene fragments will match up with the ends of the plasmid molecules. These newly joined ends can link together and enzymatically recircularize the plasmid, now containing its new DNA insert. The ligation mixture is then introduced into E. coli or another suitable host, and the newly engineered plasmids will be amplified as the bacteria divide. In this manner, a relatively large number of copies of the human gene may be obtained and harvested from the bacteria. These gene copies can then be further manipulated for the purpose of research, analysis, or production of its gene product protein.

[0017] Recombinant DNA technology is frequently embodied in the generation of so-called “transgenes”. Transgenes frequently comprise a variety of genetic materials that are derived from one or more donor organisms and introduced into a host organism. Typically, a transgene is constructed using a cloning vector as the starting point or “backbone” of the project, and a series of complex cloning steps are planned to assemble the final product within that vector. Elements of a transgene, comprising nucleotide sequences, include, but are not limited to 1) regulatory promoter and/or enhancer elements, 2) a gene that will be expressed as a mRNA molecule, 3) DNA elements that provide mRNA message stabilization, 4) nucleotide sequences mimicking mammalian intronic gene regions, and 5) signals for mRNA processing such as the poly-A tail added to the end of naturally-occurring mRNAs. In some cases, an experimental design may require addition of localization signal to provide for transport of the gene product to a particular subcellular location. Each of these elements is a fragment of a larger DNA molecule that is cut from a donor genome, or, in some cases, synthesized in a laboratory. Each piece is assembled with the others in a precise order and 5′-3′ orientation into a cloning vector plasmid.

[0018] The promoter of any gene may be isolated as a DNA fragment and placed within a synthetic molecule, such as a plasmid, to direct the expression of a desired gene, assuming that the necessary conditions for stimulation of the promoter of interest can be provided. For example, the promoter sequences of the insulin gene may be isolated, placed in a cloning vector plasmid along with a reporter gene, and used to study the conditions required for expression of the insulin gene in an appropriate cell type. Alternatively, the insulin gene promoter may be joined with the protein coding-sequence of any gene of interest in a cloning vector plasmid, and used to drive expression of the gene of interest in insulin-expressing cells, assuming that all necessary elements are present within the DNA transgene so constructed.

[0019] A reporter gene is a particularly useful component of some types of transgenes. A reporter gene comprises nucleotide sequences encoding a protein that will be expressed under the direction of a particular promoter of interest to which it is linked in a transgene, providing a measurable biochemical response of the promoter activity. A reporter gene is typically easy to detect or measure against the background of endogenous cellular proteins. Commonly used reporter genes include but are not limited to LacZ, green fluorescent protein, and luciferase, and other reporter genes, many of which are well-known to those skilled in the art.

[0020] Introns are not found in bacterial genomes, but are required for proper formation of mRNA molecules in mammalian cells. Therefore, any DNA construct for use in mammalian systems must have at least one intron. Introns may be isolated from any mammalian gene and inserted into a DNA construct, along with the appropriate splicing signals that allow mammalian cells to excise the intron and splice the remaining mRNA ends together.

[0021] An mRNA stabilization element is a sequence of DNA that is recognized by binding proteins that protect some mRNAs from degradation. Inclusion of an mRNA stabilization element will frequently enhance the level of gene expression from that mRNA in some mammalian cell types, and so can be useful in some DNA constructs or transgenes. An mRNA stabilization element can be isolated from naturally occurring DNA or RNA, or synthetically produced for inclusion in a DNA construct.

[0022] A localization signal is a sequence of DNA that encodes a protein signal for subcellular routing of a protein of interest. For example, a nuclear localization signal will direct a protein to the nucleus; a plasma membrane localization signal will direct it to the plasma membrane, etc. Thus, a localization signal may be incorporated into a DNA construct to promote the translocation of its protein product to the desired subcellular location.

[0023] A tag sequence may be encoded in a DNA construct so that the protein product will have an unique region attached. This unique region serves as a protein tag that can distinguish it from its endogenous counterpart. Alternatively, it can serve as an identifier that may be detected by a wide variety of techniques well known in the art, including, but not limited to, RT-PCR, immunohistochemistry, or in situ hybridization.

[0024] With a complex transgene, or with one that includes particularly large regions of DNA, there is an increased likelihood that there will be multiple recognition sites in these pieces of DNA. Recall that the recognition sequences encoding any one hexanucleotide site occur every 4096 bp. If a promoter sequence is 3000 bp and a gene of interest of 1500 bp are to be assembled into a cloning vector of 3000 bp, it is statistically very likely that many sites of 6 or less nucleotides will not be useful, since any usable sites must occur in only two of the pieces. Furthermore, the sites must occur in the appropriate areas of the appropriate molecules that are to be assembled. In addition, most cloning projects will need to have additional DNA elements added, thereby increasing the complexity of the growing molecule and the likelihood of inopportune repetition of any particular site. Since any restriction enzyme will cut at all of its sites in a molecule, if a restriction enzyme site reoccurs, all the inopportune sites will be cut along with the desired sites, disrupting the integrity of the molecule. Thus, each cloning step must be carefully planned so as not to disrupt the growing molecule by cutting it with a restriction enzyme that has already been used to incorporate a preceding element. And finally, when a researcher wishes to introduce a completed transgene into a mammalian organism, the fully-assembled transgene construct frequently must be linearized at a unique restriction site at at least one end of the transgene, thus requiring yet another unique site found nowhere else in the construct. Since most DNA constructs are designed for a single purpose, little thought is given to any future modifications that might need to be made, further increasing the difficulty for future experimental changes.

[0025] Traditionally, transgene design and construction consumes significant amounts of time and energy for several reasons, including the following:

[0026] 1. There is a wide variety of restriction and HE enzymes available that will generate an array of termini, however most of these are not compatible with each other. Many restriction enzymes, such as EcoR1, generate DNA fragments with protruding 5′ cohesive termini or “tails” ; others (e.g., Pst1) generate fragments with 3′ protruding tails, whereas still others (e.g., Bal1) cleave at the axis of symmetry to produce blunt-ended fragments. Some of these will be compatible with the termini formed by cleavage with other restriction and HE enzymes, but the majority of useful ones will not. The termini that can be generated with each DNA fragment isolation must be carefully considered in designing a DNA construct.

[0027] 2. DNA fragments needed for assembly of a DNA construct or transgene must first be isolated from their source genomes, placed into plasmid cloning vectors, and amplified to obtain useful quantities. The step can be performed using any number of commercially-available or individually altered cloning vectors. Each of the different commercially available cloning vector plasmids were, for the most part, developed independently, and thus contain different sequences and restriction sites for the DNA fragments of genes or genetic elements of interest. Genes must therefore be individually tailored to adapt to each of these vectors as needed for any given set of experiments. The same DNA fragments frequently will need to be altered further for subsequent experiments or cloning into other combinations for new DNA constructs or transgenes. Since each DNA construct or transgene is custom made for a particular application with no thought or knowledge of how it will be used next, it frequently must be “retro-fitted” for subsequent applications.

[0028] 3. In addition, the DNA sequence of any given gene or genetic element varies and can contain internal restriction sites that make it incompatible with currently available vectors, thereby complicating manipulation. This is especially true when assembling several DNA fragments into a single DNA construct or transgene.

[0029] Thus, there is a need for a system that would allow the user to rapidly assemble a number of DNA fragments into one molecule, despite redundancy of restriction sites found at the ends and within the DNA fragments. Such a system might also provide a simple means for rapidly altering the ends of the fragments so that other restriction sequences are added to them. Inclusion of single or opposing pairs of HE restriction sites would enhance the likelihood of having unique sites for cloning. A system that would also allow easy substitutions or removal of one or more of the fragments would add a level of versatility not currently available to users. Therefore, a “modular system, allowing one to insert or remove DNA fragments into or out of “cassette” regions flanked by rare restriction sites within the cloning vector would be especially useful, and welcome to the field of recombinant DNA technology.

BRIEF SUMMARY OF THE INVENTION BRIEF DESCRIPTION OF THE DRAWINGS

[0030]FIG. 1 is a linear map of the module concept of the invention.

[0031]FIG. 2 is a Docking Plasmid map.

[0032]FIG. 3 is a linear restriction map illustrating an example of restriction enzyme sites that can be included in the Docking Plasmid MCS.

[0033]FIG. 4 is a Shuttle Vector P (SVP) plasmid map

[0034]FIG. 5 is a linear restriction site map illustrating an example of restriction enzyme sites that can be included in the SVP MCS.

[0035]FIG. 6 is a Shuttle Vector E (SVE) plasmid map.

[0036]FIG. 7 is a linear restriction site map illustrating an example of restriction enzyme sites that can be included in the SVE MCS.

[0037]FIG. 8 is a Shuttle Vector 3′ (SV3) map.

[0038]FIG. 9 is a linear restriction site map illustrating an example of restriction enzyme sites that can be included in the SV3 MCS.

BRIEF DESCRIPTION OF SEQUENCE LISTING

[0039] SEQ:ID 01 is an example of a nucleotide sequence for a PE3 Docking Plasmid MCS.

[0040] SEQ:ID 02 is an example of a nucleotide sequence for a PE3 Docking Plasmid.

[0041] SEQ:ID 03 is an example of a nucleotide sequence for a Primary Docking Plasmid MCS.

[0042] SEQ:ID 04 is an example of a nucleotide sequence for a Primary Docking Plasmid.

[0043] SEQ:ID 05 is an example of a nucleotide sequence for a SVP Plasmid MCS.

[0044] SEQ:ID 06 is an example of a nucleotide sequence for a SVP Plasmid.

[0045] SEQ:ID 07 is an example of a nucleotide sequence for a SVE Plasmid MCS.

[0046] SEQ:ID 08 is an example of a nucleotide sequence for a SVE Plasmid.

[0047] SEQ:ID 09 is an example of a nucleotide sequence for a SV3 Plasmid MCS.

[0048] SEQ:ID 10 is an example of a nucleotide sequence for a SV3 Plasmid.

DEFINITIONS OF TERMS USED TO DESCRIBE THE INVENTION

[0049] As used herein, the terms “cloning vector” and “cloning vector plasmid” are used interchangeably to refer to a circular DNA molecule minimally containing an Origin of Replication, a means for positive selection of host cells harboring the plasmid such as an antibiotic-resistance gene; and a multiple cloning site.

[0050] As used herein, the term “Origin of Replication” (ORI) refers to nucleotide sequences that direct replication or duplication of a plasmid within a host cell

[0051] As used herein, the term “multiple cloning site” refers to nucleotide sequences comprising restriction sites for the purpose of cloning DNA fragments into a cloning vector plasmid.

[0052] As used herein, the term “cloning” refers to the process of ligating a DNA molecule into a plasmid and transferring it an appropriate host cell for duplication during propagation of the host.

[0053] As used herein, the term “DNA construct” refers to a DNA molecule synthesized by consecutive cloning steps within a cloning vector plasmid, and is commonly used to direct gene expression in any appropriate cell host such as cultured cells in vitro, or a transgenic mouse in vivo. A transgene used to make such a mouse can also be referred to as a DNA construct, especially during the period of time when the transgene is being designed and synthesized.

[0054] As used herein, the term “Shuttle Vector” refers to a specialized cloning vector plasmid used in the invention to make an intermediate molecule that will modify the ends of a DNA fragment.

[0055] As used herein, the term “Docking Plasmid” refers to a specialized cloning vector plasmid used in the invention to assemble DNA fragments into a DNA construct.

[0056] As used herein, the terms “restriction endonuclease” or “restriction enzyme” refers to a member or members of a classification of catalytic molecules that bind a cognate sequence of DNA and cleave the DNA molecule at a precise location within that sequence.

[0057] As used herein, the terms “cognate sequence” or “cognate sequences” refer to the minimal string of nucleotides required for a restriction enzyme to bind and cleave a DNA molecule or gene.

[0058] As used herein, the term “DNA fragment” refers to any isolated molecule of DNA, including but not limited to a protein-coding sequence, reporter gene, promoter, enhancer, intron, exon, poly-A tail, multiple cloning site, nuclear localization signal, or mRNA stabilization signal, or any other naturally occurring or synthetic DNA molecule. Alternatively, a DNA fragment may be completely of synthetic origin, produced in vitro. Furthermore, a DNA fragment may comprise any combination of isolated naturally occurring and/or synthetic fragments.

[0059] As used herein, the terms “gene promoter” or “promoter” (P) refer to a nucleotide sequence required for expression of a gene.

[0060] As used herein, the term “enhancer region” refers to a nucleotide sequence that is not required for expression of a target gene, but will increase the level of gene expression under appropriate conditions.

[0061] As used herein, the term “reporter gene” refers to a nucleotide sequences encoding a protein useful for monitoring the activity of a particular promoter of interest.

[0062] As used herein, the term “chromatin modification domain” (CMD) refers to nucleotide sequences that interact with a variety of proteins associated with maintaining and/or altering chromatin structure.

[0063] As used herein, the term “poly-A tail” refers to a sequence of adenine (A) nucleotides commonly found at the end of messenger RNA (mRNA) molecules. A Poly-A tail signal is incorporated into the 3′ ends of DNA constructs or transgenes to facilitate expression of the gene of interest.

[0064] As used herein, the term “intron” refers to the nucleotide sequences of a non-protein-coding region of a gene found between two protein-coding regions or exons.

[0065] As used herein, the term “untranslated region” (UTR) refers to nucleotide sequences encompassing the non-protein-coding region of an mRNA molecule. These untranslated regions can reside at the 5′ end (5′ UTR) or the 3′ end (3′ UTR) an mRNA molecule.

[0066] As used herein, the term “mRNA stabilization element” refers a sequence of DNA that is recognized by binding proteins thought to protect some mRNAs from degradation.

[0067] As used herein, the term “localization signal” (LOC) refers to nucleotide sequences encoding a signal for subcellular routing of a protein of interest.

[0068] As used herein, the term “tag sequence” (TAG) refers to nucleotide sequences encoding a unique protein region that allows it to be detected, or in some cases, distinguished from any endogenous counterpart.

[0069] As used herein, the term “primer site” refers to nucleotide sequences that serve as DNA templates onto which single-stranded DNA oligonucleotides can anneal for the purpose of initiating DNA sequencing, PCR amplification, and/or RNA transcription.

[0070] As used herein, the term “gene expression host selector gene” (GEH-S) refers to a genetic element that can confer resistance or toxicity to cells or organisms when treated with an appropriate antibiotic or chemical.

[0071] As used herein, the term “recombination arm” refers to nucleotide sequences that facilitate the homologous recombination between transgenic DNA and genomic DNA. Successful recombination requires the presence of a left recombination arm (LRA) and a right recombination arm (RRA) flanking a region of transgenic DNA to be incorporated into a host genome via homologous recombination.

[0072] As used herein, the term “pUC19” refers to a plasmid cloning vector well-known to those skilled in the art, and can be found in the NCBI Genbank database as Accession #L09137.

[0073] As used herein, the term “random nucleotide sequences” refers to any combination of nucleotide sequences that do not duplicate sequences encoding other elements specified as components of the same molecule.

[0074] As used herein, the term “unique” refers to any restriction endonuclease or HE site that is not found elsewhere within a DNA molecule.

DETAILED DESCRIPTION OF THE INVENTION

[0075] The present invention is a group of cloning vectors optimized to reduce the amount of manipulation frequently needed to assemble a variety of DNA fragments into a de novo DNA construct or transgene. The primary vector, herein referred to as a Docking Plasmid, contains a multiple cloning site (MCS) with 3 sets of rare restriction and/or HE sites arranged in a linear pattern. This arrangement defines a modular architecture that allows the user to assemble multiple inserts into a single transgene construct without disturbing the integrity of DNA elements already incorporated into the Docking Plasmid in previous cloning steps.

[0076] Two recognition sites for at least three HE are placed in opposite orientation to flank three modular regions for the purpose of creating a gene cassette acceptor site that cannot self-anneal. Because HE sites are asymmetric and non-palindromic, it is possible to generate non-complementary protruding 3′ cohesive tails by placing two HE recognition sites in opposite orientation. Thus, the HE I-SceI cuts its cognate recognition site as indicated by “/”: 5′...TAGGGATAA/CAGGGTAAT...3′ 3′...ATCCC/TATTGTCCCATTA...5′

[0077] The reverse placement of a second site within an MCS would generate two non-complementary cohesive protruding tails: 5′...TAGGGATAA CCCTA...3′ 3′...ATCCC AATAGGGAT...5′

[0078] This is particularly useful when it is necessary to subclone larger transgenes into a vector. Due to the size of the insert, it is thermodynamically more favorable for a vector to self anneal rather than accept a large insert. The presence of non-complementary tails generated by this placement of restriction sites provides chemical forces to counteract the thermodynamic inclination for self-ligation.

[0079] The asymmetric nature of most HE protruding tails also creates a powerful cloning tool when used in combination with the BstX I restriction enzyme site (5′CCANNNNN/NTGG 3′). The sequence-neutral domain of BstX I can be used to generate compatible cohesive ends for two reverse-oriented HE protruding tails, while precluding self-annealing. BstX I (I-Sce I Fwd.) I-Sce I Forward     I-Sce I Reverse BxtX I (I-Sce I Rev.) 5′-CCAGATAA     CAGGGTAAT//ATTACCCTGTTAT     GTGG-3′ 3′-GGTC TATTGTCCCATTA//TAATGGGAC AATACACC-5′

[0080] The secondary vectors of the invention, herein known as Shuttle vectors, contain multiple cloning sites with common restriction sites flanked by rare restriction and/or HE sites. The shuttle vectors are designed for cloning fragments of DNA into the common restriction sites between the rare sites. The cloned fragments can subsequently be released by cleavage at the rare restriction or HE site or sites, and incorporated into the Docking Plasmid using the same rare restriction and/or HE site or sites found in the shuttle vectors.

[0081] Thus, unlike conventional cloning vectors, the design of the MCS allows “cassettes” or modules of DNA fragments to be inserted into the modular regions of the Docking Plasmid. Likewise, each can be easily removed using the same rare restriction and/or HE enzymes, and replaced with any other DNA fragment of interest. This feature allows the user to change the direction of an experimental project quickly and easily without having to rebuild the entire DNA construct. Thus, the cloning vector plasmids of the present invention allow the user to clone a DNA fragment into an intermediate vector using common restriction sites, creating a cassette-accepting module, and to then transfer that fragment to the desired modular spot in the final construct by means of rare restriction sites. Furthermore, it allows future alterations to the molecule to replace individual modules in the Docking Plasmid with other cassette modules. The following descriptions highlight distinctions of the present invention compared with the prior art.

[0082] Individual components of a transgene (the promoter enhancer P, expressed protein E, and/or 3′ regulatory region 3) can be assembled as modules transferred from shuttle vectors into the PE3 Docking Station Plasmid. If higher orders of complexity are needed, the assembled transgenes, or other nucleotide sequences, can then be transferred into a Primary Docking Plasmid. Each of the five types of cloning vector plasmids will be explained in greater detail to provide an understanding of the components incorporated into each, beginning with the more complex PE3 Docking Station Plasmid and the Primary Docking Plasmid.

[0083] The PE3 Docking Plasmid (FIG. 2) comprises a pUC19 backbone with the following modifications, wherein the sequences are numbered according to the pUC19 Genbank sequence file, Accession #L09137:

[0084] 1. Only sequences from 806 to 2617 (Af13-Aat2) are used in the Docking Plasmid,

[0085] 2. The BspH1 site at 1729 in pUC19 is mutated from TCATGA to GCATGA,

[0086] 3. The Acl1 site at 1493 in pUC19 is mutated from AACGTT to AACGCT,

[0087] 4. The Acl1 site at 1120 in pUC19 is mutated from AACGTT to CACGCT,

[0088] 5. The Ahd1 site in pUC19 is mutated from GACNNNNNGTC to CACNNNNNGTC,

[0089] 6. Sequences encoding BspH1/I-Ppo 1/BspH1 are inserted at the only remaining BspH1 site in pUC19 following the mutation step 2 in the list above.

[0090] The multiple cloning site (MCS) in the PE3 Docking Plasmid (FIG. 3) comprises the following sequential elements, in the order listed:

[0091] 1. Three non-variable and unique common restriction sites that define a 5′ insertion site for the mutated pUC19 vector described above (for example, Aat II, Blp I, and EcoO109 I),

[0092] 2. A T7 primer site.

[0093] 3. A unique HE site (for example, I-SceI (forward orientation)),

[0094] 4. A pair of non-variable and unique, common restriction sites flanking random nucleotide sequences that can serve as a chromatin modification domain acceptor module (RNAS-CMD-1) (for example, Kpn I and Avr II),

[0095] 5. A fixed grouping of non-variable rare restriction sites that define the 5′ portion of the promoter module (for example, AsiS I and SgrA I),

[0096] 6. Random nucleotide sequences that can serve as a Promoter/intron acceptor module (RNAS-P),

[0097] 7. A fixed grouping of non-variable rare restriction sites that define the shared junction between the 3′ portion of the Promoter/intron module and the 5′ portion of the Expression module (for example, PacI, AscI, and MluI),

[0098] 8. Random nucleotide sequences that can serve as an expression acceptor module (RNAS-E),

[0099] 9. A fixed grouping of non-variable rare restriction sites that define the junction of the 3′ portion of the Expression module and the 5′ portion of the 3′ Regulatory module (for example, SnaB I, Not I, and Sal I),

[0100] 10. Random nucleotide sequences that can serve as a 3′ regulatory domain acceptor module (RNAS-3),

[0101] 11. A fixed grouping of non-variable rare restriction sites that define the 3′ portion of the 3′ Regulatory module (for example, Swa I, Rsr II, and BsiW I),

[0102] 12. A pair of non-variable and unique, common restriction sites flanking a random nucleotide sequence of DNA that can serve as a chromatin modification domain acceptor module (RNAS-CMD-2) (for example, Xho I and Nhe I),

[0103] 13. A unique HE site in reverse orientation that is identical to that in item 3, above,

[0104] 14. A T3 primer site in reverse orientation, and

[0105] 15. Four non-variable and unique common restriction sites that define a 3′ insertion site for the mutated pUC19 vector described above (for example, BspE I, Pme I, Sap I, and BspH I).

[0106] The Primary Docking Plasmid (FIG. 4) can be used to assemble two completed transgenes that are first constructed in PE3 Docking Station Plasmids, or two homology arms needed to construct a gene-targeting transgene, or to introduce two types of positive or negative selection elements. The multiple cloning site (MCS) in the Primary Docking Plasmid (FIG. 5) comprises the following sequential elements, in the order listed:

[0107] 1. Two non-variable and unique common restriction sites that define a 5′ insertion site for the mutated pUC19 vector described above (for example, Aat II and Blp I),

[0108] 2. An M13 Rev. primer site,

[0109] 3. A pair of unique HE sites in opposite orientation flanking a random nucleotide sequence of DNA that can serve as a genome expression host selector gene acceptor module (RNAS-GEH-S1) (for example, PI-SceI (forward orientation) and PI-SceI (reverse orientation)),

[0110] 4. A non-variable and unique, common restriction site that allows cloning of a shuttle vector module downstream of the HE pair (for example, Eco0109I),

[0111] 5. A fixed grouping of non-variable rare restriction sites that define the 5′ portion a Left Recombination Arm module (for example, SgrA I and AsiS I),

[0112] 6. Random nucleotide sequences that can serve as a Left Recombination Arm acceptor module (RNAS-LRA),

[0113] 7. A fixed grouping of non-variable rare restriction sites that define the 3′ portion of the Left Recombination Arm acceptor module (for example, PacI, MluI, and AscI),

[0114] 8. A unique HE site (for example, I-Ceu I (forward orientation)),

[0115] 9. A pair of non-variable and unique, common restriction sites flanking a random nucleotide sequence of DNA that can serve as a chromatin modification domain acceptor module (RNAS-CMD-1) (for example, Kpn I and Avr II),

[0116] 10. A T7 primer site,

[0117] 11. A pair of unique BstX I sites in opposite orientation (wherein the variable nucleotide region in the BstX I recognition site is defined by nucleotides identical to the non-complementary tails generated by the ordering of two identical HE recognition sites arranged in reverse-complement orientation; for example, PI-SceI (forward orientation) and PI-SceI (reverse orientation)) flanking a random nucleotide sequence of DNA that can serve as a complex transgene acceptor module (RNAS-PE3-1),

[0118] 12. A pair of unique HE sites in opposite orientation flanking a random nucleotide sequence of DNA that can serve as a complex transgene acceptor module (RNAS-PE3-2) (for example, I-SceI (forward orientation) and I-SceI (reverse orientation)),

[0119] 13. A T3 primer site in reverse-orientation,

[0120] 14. A pair of non-variable and unique, common restriction sites flanking a random nucleotide sequence of DNA that can serve as a chromatin modification domain acceptor module (RNAS-CMD-2) (for example, Xho I and Nhe I),

[0121] 15. A unique HE site in reverse orientation that is identical to that in item 8 above,

[0122] 16. A fixed grouping of non-variable rare restriction sites that define the 5′ portion a Right Recombination Arm module (for example, SnaB I, Sal I, and Not I),

[0123] 17. Random nucleotide sequences that can serve as a Right Recombination Arm acceptor module (RNAS-RRA),

[0124] 18. A fixed grouping of non-variable rare restriction sites that define the 3′ portion of the Right Recombination Arm acceptor module (for example, Rsr II, Swa I, and BsiW I),

[0125] 19. A non-variable and unique, common restriction site that allows cloning of a shuttle vector module upstream of an HE pair (for example, BspE I),

[0126] 20. A pair of unique HE sites in opposite orientation flanking a random nucleotide sequence of DNA that can serve as a genome expression host selector gene acceptor module (RNAS-GEH-S2) (for example, PI-Psp I (forward orientation) and PI-Psp I (reverse orientation)),

[0127] 21. An M13 Forward primer site placed in reverse orientation,

[0128] 22. Three non-variable and unique common restriction sites that define a 3′ insertion site for the mutated pUC19 vector described above (for example, Pme I, Sap I, and BspH I).

[0129] Three cloning vector plasmids of the invention are known as Shuttle Vectors. The Shuttle Vectors, like the PE3 and Primary Docking Plasmids, are also constructed from a pUC19 backbone. Just like the PE3 and Primary Docking Plasmids, each Shuttle Vector has the same modifications to the pUC19 backbone listed as 1 through 6, above. The individual Shuttle Vectors (SV) are identified as Shuttle Vector Promoter/intron (P), Shuttle Vector Expression (E), and Shuttle Vector 3′Regulatory (3); henceforth SVP, SVE, and SV3, respectively. Each is described more fully below.

[0130] Shuttle Vector P (SVP)

[0131] SVP is a cloning vector plasmid that one can be used to prepare promoter and intron sequences for assembly into a transgene construct (FIG. 6). An example of an SVP Plasmid can comprise the following sequential elements in the MCS (FIG. 7), in the order listed:

[0132] 1. Two non-variable and unique, common restriction sites that define a 5′ insertion site for the mutated pUC19 vector described above (for example, AatII and BlpI),

[0133] 2. A T7 primer site,

[0134] 3. A non-variable and unique, common restriction site that allows efficient cloning of a shuttle vector module downstream of the T7 primer site (for example, Eco0109I),

[0135] 4. A fixed grouping of non-variable rare restriction sites that define the 5′ portion of the promoter module (for example, AsiSI and SgrAI),

[0136] 5. A variable MCS comprising any grouping of common or rare restriction sites that are unique to the shuttle vector (for example, the series of restriction sites illustrated in FIG. 7),

[0137] 6. A fixed grouping of non-variable rare restriction sites that define the 3′ portion of the promoter module (for example, PacI, AscI, and MluI)

[0138] 7. A non-variable and unique, common restriction site that allows efficient cloning of a shuttle vector module upstream of the T3 primer site (for example, BspEI)

[0139] 8. A reverse-orientation T3 primer site, and

[0140] 9. Two non-variable and unique, common restriction sites that define a 3′ insertion site for the mutated pUC19 vector described above (for example, PmeI and SapI).

[0141] Shuttle Vector E (SVE)

[0142] This is a cloning vector plasmid that can be used to prepare sequences to be expressed by the transgene for assembly into a transgene construct (FIG. 8). An example of an SVE plasmid can comprise the following sequential elements in the MCS (FIG. 9), in the order listed:

[0143] 1. Two non-variable and unique, common restriction sites that define a 5′ insertion site for the mutated pUC19 vector described above (for example, AatII and Blp \I),

[0144] 2. A T7 primer site,

[0145] 3. A non-variable and unique, common restriction site that allows efficient cloning of a shuttle vector module downstream of the T7 primer site (for example, Eco0109\I),

[0146] 4. A fixed grouping of non-variable rare restriction sites that define the 5′ portion of the expression module (for example, PacI, AscI, and MluI),

[0147] 5. A variable MCS consisting of any grouping of common or rare restriction sites that are unique to the shuttle vector (for example, the series of restriction sites illustrated in FIG. 9),

[0148] 6. A fixed grouping of non-variable rare restriction sites that define the 3′ portion of the expression module (for example, SnaBI, NotI, and SalI),

[0149] 7. A non-variable and unique, common restriction site that allows efficient cloning of a shuttle vector module upstream of the T3 primer site (for example, BspEI)

[0150] 8. A reverse-orientation T3 primer site, and

[0151] 9. Two non-variable and unique, common restriction sites that define a 3′ insertion site for the 30 mutated pUC19 vector described above (for example, PmeI and SapI).

[0152] Shuttle Vector 3 (SV3)

[0153] This is a cloning vector plasmid that can be used to prepare 3′ regulatory sequences for assembly into a transgene construct (FIG. 10). An example of an SV3 plasmid can comprise the following elements in the MCS (FIG. 11), in the order listed:

[0154] 1. Two non-variable and unique, common restriction sites that define a 5′ insertion site for the mutated pUC19 vector described above (for example, AatII and BlpI),

[0155] 2. A T7 primer site,

[0156] 3. A non-variable and unique, common restriction site that allows efficient cloning of a shuttle vector module downstream of the T7 primer (for example, Eco0109I),

[0157] 4. A fixed grouping of non-variable rare restriction sites that define the 5′ portion of the 3′ regulatory module (for example, SnaBI, NotI, and SalI),

[0158] 5. A variable MCS consisting of any grouping of common or rare restriction sites that are unique to the shuttle vector (for example, the series of restriction sites illustrated in FIG. 11),

[0159] 6. A fixed grouping of non-variable rare restriction sites that define the 3′ portion of the 3′ regulatory module (for example, SwaI, RsrII, and BsiWI),

[0160] 7. A non-variable and unique, non-rare restriction site that allows efficient cloning of a shuttle vector module upstream of the T3 primer site (for example, BspEI),

[0161] 8. A reverse-orientation T3 primer site, and

[0162] 9. Two non-variable and unique, non-rare restriction sites that define a 3′ insertion site for the mutated pUC19 vector described above (for example, PmeI and SapI).

[0163] While the present invention discloses methods for building transgenes in plasmid cloning vectors, similar methods can be used to build transgenes in larger extrachromosomal DNA molecules such as cosmids or artificial chromosomes, including bacterial artificial chromosomes (BAC). The wide variety of genetic elements that can be incorporated into the plasmid cloning vectors also allow transfer of the final transgene products into a wide variety of host organisms with little or no further manipulation.

[0164] As an example of the method of practicing the present invention, a transgene can be constructed containing these elements:

[0165] 1. Nucleotide sequences of the human promoter for surfactant protein C (SP—C),

[0166] 2. Sequences encoding the protein product of the mouse gene granulocyte-macrophage colony-stimulating factor-receptor beta c (GMRβc)

[0167] 3. Rabbit betaglobin intron sequences, and

[0168] 4. SV40 poly-A signal.

[0169] The SP—C sequences contain internal BamH1 sites, and can be released from its parental plasmid only with Not1 and EcoR1. GMRβc has an internal Not1 site, and can be cut from its parental plasmid with BamH1 and Xho1. The rabbit betaglobin intron sequences can be cut out of its parental plasmid with EcoR1. The SV-40 poly-A tail can be cut from its parental plasmid with Xho1 and Sac1. Because of redundancy of several of restriction sites, none of the parental plasmids can be used to assemble all the needed fragments.

[0170] The steps used to build the desired transgene in the PE3 Docking Plasmid invention are as follows.

[0171] 1. Since Not1 and PspOM1 generate compatible cohesive ends, the human SP—C promoter sequences are excised with Not1 and EcoR1 and cloned into the PspOM1 and EcoR1 sites of Shuttle Vector P. The product of this reaction is called pSVP-SPC

[0172] 2. Following propagation and recovery steps well known to those skilled in the art, the rabbit betaglobin intron sequences are cloned into the EcoR1 site of pSVP-SPC. Orientation of the intron in the resultant intermediate construct is verified by sequencing the product, called pSVP-SPC-rβG.

[0173] 3. The promoter and intron are excised and isolated as one contiguous fragment from pSVP-SPC-rβG using AsiS1 and Asc1. Concurrently, the PE3 Docking Plasmid is cut with AsiS1 and Asc1 in preparation for ligation with the promoter/intron segment. The promoter/intron fragment is ligated into the Docking Plasmid, propagated, and recovered.

[0174] 4. The Xho1 site of the GMRβc fragment is filled in to create a blunt 3′ end, using techniques well known to those skiilled in the art. It is then cloned into the BamH1 site and the blunt-ended Pvu2 site of pSVP-SPC-rβG. The resultant plasmid (pDP-SPC-GMRβc-rβG) was propagated and recovered.

[0175] 5. The final cloning step is the addition of the SV-40 Poly-A tail. The SV40-polyA fragment is cut out with Xho1 and Sac1, as is the recipient vector pDS1-SPC-GMRβc-rbβG. Both pieces of DNA are gel purified and recovered. A ligation mix is prepared with a 10:1 molar ratio of SV-40polyA to pDS1-SPC-GMRβc-rβG. The ligation products are propagated and harvested. The new plasmid, pDS1-SPC-GMRβc-rβG-pA contains all elements required for the transgene, including a unique restriction site at the 3′ end with which the entire pDS1-SPC-GMRβc-rβG-pA plasmid can be linearized for transfection into eukaryotic cells or microinjection into the pronucleus of a fertilized ovum.

1 10 1 3452 DNA Artificial Sequence Synthetic Construct 1 tgagcagcgg ataacaattt cacacaggaa acagctatga ccatgattac tctgtagcat 60 ctatgtcggg tgcggagaaa gaggtaatga aatggcannn nnnnnnnnnn nnnnnnnnnn 120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 180 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnntgc catttcatta 360 cctctttctc cgcacccgac atagataggc cctgcgccgg cggcgatcgc nnnnnnnnnn 420 nnnnnnnnnn ttaattaaac gcgtggcgcg cctaactata acggtcctaa ggtagcgagg 480 taccgctggc cctagggtaa tacgactcac tatagggcca cataagtggn nnnnnnnnnn 540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc 780 cacttatgtg gtagggataa cagggtaatn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna ttaccctgtt atccctatcc 1080 ctttagtgag ggttaattct cgaggcagga gctagctcgc taccttagga ccgttatagt 1140 tatacgtagt cgacgcggcc gcnnnnnnnn nnnnnnnnnn nncggtccga tttaaatcgt 1200 acgtccggat ggcaaacagc tattatgggt attatgggtn nnnnnnnnnn nnnnnnnnnn 1260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna cccataatac 1500 ccataatagc tgtttgccag ctacagagtt tactggccgt cgttttacaa cgtcgtgact 1560 gggaaaaccc tggcggttta aacgctcttc cgcttccttc atgtgagcaa aaggccagca 1620 aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 1680 tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 1740 aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 1800 gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 1860 acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 1920 accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 1980 ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 2040 gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 2100 gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 2160 ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 2220 gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 2280 cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgataacta tgactctctt 2340 aaggtagcca aattcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa 2400 aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 2460 tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 2520 tgactccccg tggtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 2580 gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 2640 gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 2700 aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgtggtt 2760 gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 2820 ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 2880 tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 2940 atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 3000 ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 3060 ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 3120 ggaaagcgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 3180 atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 3240 gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 3300 tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 3360 ctcatgcgcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 3420 acatttcccc gaaaagtgcc acctgacgtc gc 3452 2 792 DNA Artificial Sequence PE3 Docking Plasmid MCS 2 gagagagaga cgtcgctgag caggccctgt aatacgactc actatagggg gcgccggagc 60 ttagggataa cagggtaatg gtaccnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 180 nnnnncctag ggcgatcgcc gccggcgnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnntta attaaggcgc gccacgcgtn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 420 nnnnnnnnnt acgtagcggc cgcgtcgacg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 540 nnnnnnnnnn atttaaatcg gtccgcgtac gcatatagct aacagcctcg agnnnnnnnn 600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngctagcat taccctgtta tccctagccg 720 ctggcgcttc cctttagtga gggttaattt ccggagttta aacgctcttc cgcttccttc 780 atgagagaga ga 792 3 3452 DNA Artificial Sequence Primary Docking Plasmid 3 tgagcagcgg ataacaattt cacacaggaa acagctatga ccatgattac tctgtagcat 60 ctatgtcggg tgcggagaaa gaggtaatga aatggcannn nnnnnnnnnn nnnnnnnnnn 120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 180 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnntgc catttcatta 360 cctctttctc cgcacccgac atagataggc cctgcgccgg cggcgatcgc nnnnnnnnnn 420 nnnnnnnnnn ttaattaaac gcgtggcgcg cctaactata acggtcctaa ggtagcgagg 480 taccgctggc cctagggtaa tacgactcac tatagggcca cataagtggn nnnnnnnnnn 540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc 780 cacttatgtg gtagggataa cagggtaatn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna ttaccctgtt atccctatcc 1080 ctttagtgag ggttaattct cgaggcagga gctagctcgc taccttagga ccgttatagt 1140 tatacgtagt cgacgcggcc gcnnnnnnnn nnnnnnnnnn nncggtccga tttaaatcgt 1200 acgtccggat ggcaaacagc tattatgggt attatgggtn nnnnnnnnnn nnnnnnnnnn 1260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna cccataatac 1500 ccataatagc tgtttgccag ctacagagtt tactggccgt cgttttacaa cgtcgtgact 1560 gggaaaaccc tggcggttta aacgctcttc cgcttccttc atgtgagcaa aaggccagca 1620 aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 1680 tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 1740 aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 1800 gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 1860 acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 1920 accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 1980 ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 2040 gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 2100 gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 2160 ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 2220 gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 2280 cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgataacta tgactctctt 2340 aaggtagcca aattcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa 2400 aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa 2460 tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 2520 tgactccccg tggtgtagat aactacgata cgggagggct taccatctgg ccccagtgct 2580 gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca 2640 gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt 2700 aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgtggtt 2760 gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 2820 ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc 2880 tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt 2940 atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact 3000 ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc 3060 ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 3120 ggaaagcgtt cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg 3180 atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct 3240 gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa 3300 tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt 3360 ctcatgcgcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 3420 acatttcccc gaaaagtgcc acctgacgtc gc 3452 4 1628 DNA Artificial Sequence Primary Docking Plasmid MCS 4 gagagagaga cgtcgctgag cagcggataa caatttcaca caggaaacag ctatgaccat 60 gattactctg tagcatctat gtcgggtgcg gagaaagagg taatgaaatg gcannnnnnn 120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 180 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 nnntgccatt tcattacctc tttctccgca cccgacatag ataggccctg cgccggcggc 420 gatcgcnnnn nnnnnnnnnn nnnnnnttaa ttaaacgcgt ggcgcgccta actataacgg 480 tcctaaggta gcgaggtacc gctggcccta gggtaatacg actcactata gggccacata 540 agtggnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 nnnnnnnnnn nnnnnccact tatgtggtag ggataacagg gtaatnnnnn nnnnnnnnnn 840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1020 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnattac 1080 cctgttatcc ctatcccttt agtgagggtt aattctcgag gcaggagcta gctcgctacc 1140 ttaggaccgt tatagttata cgtagtcgac gcggccgcnn nnnnnnnnnn nnnnnnnncg 1200 gtccgattta aatcgtacgt ccggatggca aacagctatt atgggtatta tgggtnnnnn 1260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 nnnnnaccca taatacccat aatagctgtt tgccagctac agagtttact ggccgtcgtt 1560 ttacaacgtc gtgactggga aaaccctggc ggtttaaacg ctcttccgct tccttcatga 1620 gagagaga 1628 5 2092 DNA Artificial Sequence SVP Plasmid 5 tgagcgtaat acgactcact atagggaggc cctgcgatcg ccgccggcgg atatcggagc 60 tgctgggccc agggagcttc tagaggagct ggatccgctg gagaattcgg agctggaaag 120 cttggagctg ctctgcaggg agctgcatgc gctggcgcac agctgttaat taaggcgcgc 180 cacgcgttcc ggattccctt tagtgagggt taattgttta aacgctcttc cgcttccttc 240 atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt 300 ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg 360 cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc 420 tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc 480 gtggcgcttt ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc 540 aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac 600 tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt 660 aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct 720 aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc 780 ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt 840 ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg 900 atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc 960 atgataacta tgactctctt aaggtagcca aattcatgag attatcaaaa aggatcttca 1020 cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 1080 cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 1140 ttcgttcatc catagttgcc tgactccccg tggtgtagat aactacgata cgggagggct 1200 taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 1260 tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 1320 ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 1380 atagtttgcg caacgtggtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 1440 gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 1500 tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 1560 cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 1620 taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 1680 ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 1740 ctttaaaagt gctcatcatt ggaaagcgtt cttcggggcg aaaactctca aggatcttac 1800 cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 1860 ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 1920 gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 1980 gcatttatca gggttattgt ctcatgcgcg gatacatatt tgaatgtatt tagaaaaata 2040 aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc gc 2092 6 234 DNA Artificial Sequence SVP Plasmid MCS 6 cgctgagcgt aatacgactc actataggga ggccctgcga tcgccgccgg cggatatcgg 60 agctgctggg cccagggagc ttctagagga gctggatccg ctggagaatt cggagctgga 120 aagcttggag ctgctctgca gggagctgca tgcgctggcg cacagctgtt aattaaggcg 180 cgccacgcgt tccggattcc ctttagtgag ggttaattgt ttaaacgctc ttcc 234 7 2097 DNA Artificial Sequence SVE Plasmid 7 tgagcgtaat acgactcact atagggaggc cctgttaatt aaggcgcgcc acgcgtgata 60 tcggagctgc tgggcccagg gagcttctag aggagctgga tccgctggag aattcggagc 120 tggaaagctt ggagctgctc tgcagggagc tgcatgcgct ggcgcacagc tgtacgtagc 180 ggccgcgtcg actccggatt ccctttagtg agggttaatt gtttaaacgc tcttccgctt 240 ccttcatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 300 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 360 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 420 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 480 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 540 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 600 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 660 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 720 ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 780 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 840 gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 900 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 960 tggtcatgat aactatgact ctcttaaggt agccaaattc atgagattat caaaaaggat 1020 cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 1080 gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 1140 tctatttcgt tcatccatag ttgcctgact ccccgtggtg tagataacta cgatacggga 1200 gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 1260 agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 1320 tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 1380 agttaatagt ttgcgcaacg tggttgccat tgctacaggc atcgtggtgt cacgctcgtc 1440 gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 1500 catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 1560 ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 1620 atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 1680 tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 1740 cagaacttta aaagtgctca tcattggaaa gcgttcttcg gggcgaaaac tctcaaggat 1800 cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 1860 atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 1920 aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 1980 ttgaagcatt tatcagggtt attgtctcat gcgcggatac atatttgaat gtatttagaa 2040 aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtcgc 2097 8 239 DNA Artificial Sequence SVE Plasmid MCS 8 cgctgagcgt aatacgactc actataggga ggccctgtta attaaggcgc gccacgcgtg 60 atatcggagc tgctgggccc agggagcttc tagaggagct ggatccgctg gagaattcgg 120 agctggaaag cttggagctg ctctgcaggg agctgcatgc gctggcgcac agctgtacgt 180 agcggccgcg tcgactccgg attcccttta gtgagggtta attgtttaaa cgctcttcc 239 9 2096 DNA Artificial Sequence SV3 Plasmid 9 tgagcgtaat acgactcact atagggaggc cctgtacgta gcggccgcgt cgacgatatc 60 ggagctgctg ggcccaggga gcttctagag gagctggatc cgctggagaa ttcggagctg 120 gaaagcttgg agctgctctg cagggagctg catgcgctgg cgcacagctg atttaaatcg 180 gtccgcgtac gtccggattc cctttagtga gggttaattg tttaaacgct cttccgcttc 240 cttcatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 300 gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag 360 gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt 420 gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg 480 aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg 540 ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg 600 taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac 660 tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg 720 gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt 780 taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 840 tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc 900 tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt 960 ggtcatgata actatgactc tcttaaggta gccaaattca tgagattatc aaaaaggatc 1020 ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 1080 taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 1140 ctatttcgtt catccatagt tgcctgactc cccgtggtgt agataactac gatacgggag 1200 ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 1260 gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 1320 ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 1380 gttaatagtt tgcgcaacgt ggttgccatt gctacaggca tcgtggtgtc acgctcgtcg 1440 tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 1500 atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 1560 gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 1620 tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 1680 atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 1740 agaactttaa aagtgctcat cattggaaag cgttcttcgg ggcgaaaact ctcaaggatc 1800 ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 1860 tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 1920 aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 1980 tgaagcattt atcagggtta ttgtctcatg cgcggataca tatttgaatg tatttagaaa 2040 aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtcgc 2096 10 238 DNA Artificial Sequence SV3 Plasmid MCS 10 cgctgagcgt aatacgactc actataggga ggccctgtac gtagcggccg cgtcgacgat 60 atcggagctg ctgggcccag ggagcttcta gaggagctgg atccgctgga gaattcggag 120 ctggaaagct tggagctgct ctgcagggag ctgcatgcgc tggcgcacag ctgatttaaa 180 tcggtccgcg tacgtccgga ttccctttag tgagggttaa ttgtttaaac gctcttcc 238 

What is claimed is:
 1. A cloning vector plasmid, wherein sequential elements encoding a modular structure comprise a. three non-variable and unique common restriction sites, b. a 5′ oligonucleotide primer site, c. a unique HE site in a forward orientation, d. a pair of non-variable and unique, common restriction sites flanking random nucleotide sequences, e. a fixed grouping of non-variable rare restriction sites to define a 5′ portion of a promoter module, f. random nucleotide sequences, g. a fixed grouping of non-variable rare restriction sites that define a shared junction between a 3′ position relative to the Promoter/intron module and a 5′ position relative to an Expression module, h. random nucleotide sequences, i. a fixed grouping of non-variable rare restriction sites that define a junction of a 3′ position relative to the Expression module and a 5′ position relative to a 3′ Regulatory module, j. random nucleotide sequences, k. a fixed grouping of non-variable rare restriction sites that define a 3′ position relative to a 3′ Regulatory module, l. a pair of non-variable and unique, common restriction sites flanking random nucleotide sequences, m. a unique HE site in reverse orientation that is the same HE site as that placed 3′ of the 5′ oligonucleotide primer site, n. a 3′ oligonucleotide primer site in reverse orientation, and o. four non-variable and unique common restriction sites that define a 3′ insertion site.
 2. The cloning vector plasmid of claim 1, wherein the modular structure of the sequential elements is ordered to allow introduction of three discrete regions of nucleotide sequences, such that the nucleotide sequences so introduced replace the random nucleotide sequences in the Promoter, Expression, and 3′ Regulatory modules.
 3. A cloning vector plasmid, wherein sequential elements encoded comprise a. two non-variable and unique common restriction sites that define a 5′ insertion site, b. an oligonucleotide primer site, c. a pair of unique HE sites in opposite orientation flanking random nucleotide sequences, d. a non-variable and unique, common restriction site that allows cloning of a shuttle vector module downstream of the pair of unique HE sites, e. a fixed grouping of non-variable rare restriction sites, f. random nucleotide sequences, g. a fixed grouping of non-variable rare restriction sites, h. a unique HE site in a forward orientation, i. a pair of non-variable and unique, common restriction sites flanking random nucleotide sequences, j. an oligonucleotide primer site, k. a pair of unique BstX I sites in opposite orientations (wherein the variable nucleotide region in the BstX I recognition site is defined by nucleotides identical to the non-complementary tails generated by the ordering of two identical HE recognition sites arranged in reverse-complement orientation, l. a pair of unique HE sites in opposite orientations flanking random nucleotide sequences m. an oligonucleotide primer site in reverse-orientation, n. a pair of non-variable and unique, common restriction sites flanking random nucleotide sequences, o. a unique HE site in reverse orientation, with the HE site being the same as the HE site in a forward orientation, p. a fixed grouping of non-variable rare restriction sites, q. random nucleotide sequences, r. a fixed grouping of non-variable rare restriction sites, s. a non-variable and unique, common restriction site, t. a pair of unique HE sites in opposite orientation flanking random nucleotide sequences, u. an oligonucleotide primer site in reverse orientation, v. three non-variable and unique common restriction sites.
 4. The cloning vector plasmid of claim 3, wherein the modular structure of the sequential elements is ordered to allow introduction of two discrete transgenes, such that the nucleotide sequences so introduced replace random nucleotide sequences.
 5. The cloning vector plasmid of claim 3, wherein the modular structure of the sequential elements is ordered to allow introduction of two discrete regions of nucleotide sequences such as are needed to induce homologous recombination of a transgene to achieve targeted mutation of a gene.
 6. The cloning vector plasmid of claim 3, wherein the modular structure of the sequential elements is ordered to allow introduction of two distinct positive or negative selection elements.
 7. A method for constructing a transgene, comprising the steps of a. providing a cloning vector plasmid comprising a sequential arrangement of Promoter, Expression, and 3′ Regulatory modules, b. introducing first nucleotide sequences to be included in the transgene into a shuttle vector, c. transferring the first nucleotide sequences from the shuttle vector to the cloning vector plasmid comprising modular elements, d. sequentially introducing any other desired nucleotide sequences into another shuttle vector, e. and transferring the other desired nucleotide sequences into the cloning vector plasmid which received the first nucleotide sequences. 